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ABSTRACT 

In the linearization of systems of non-linear differential equations, we consider those 
systems which can be exactly transformed into the second order linear differential 
equation Y / -AY , -BY=0 where Y, Yj and Y^are n x 1 vectors and A and B are constant 
n x n matrices of real numbers. We use the 2n x 2n matrix jjjto transform the 
above matrix equation into the first order matrix equation X # = MX. We study 
specifically the matrix M and the conditions which will diagonalize or triangularize M. 
We indicate transformation matrices P and P 1 to accomplish this diagonalization or 
triangularization and how to use these to return to the solution of the second order 
matrix differential equation system from the first order system. 


We conclude with a study of the relationship between the diagonalization of M to that 
of the submatrices A and B. 
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Introduction 


(Szebehely, 1976), (Bond and Horn, 1978), and (Bond, 1982) have studied the 
linearization of systems of non-linear differential equations. In each of these cases 
the transformation of one system to the other involves square matrices A and B of 
order n. 


In the present paper we consider only those non-linear differential equation systems 
which car be exactly transformed into the second order linear differential equation 
system Y^-AY^BYsO where the matrices A and B are constant matrices of real 
numbers and where Y, Y*, and Y^are n x 1 vectors. 


We form the matrix M =j£ *Jwhich is a matrix of order 2n which arises in the standard 
transformation of the second-order linear equation to a first-order linear equation by 
the transformation 

y where Y is the vector (Y, , Y, , ..., Yn) and Y 1 = (y/ , Y^ , ..., Yn). 

Z^Y' 

J 

Observe that this transformation yields Z ( ; = Y / = Z x and Z^ = Y" - AZ- + BZ,. 

Written in matrix form we have 

R] ■' [* *]{y 

Letting X =^'jand M = ^ *Jthe equation Y w = AY + BY can be reduced to the linear 
first-order matrix equation X*= MX. 

This research is concerned with the following questions: 

(1.) What are the conditions on the matrix M which will allow us to solve the matrix 
differential equation system X # = MX. 
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(2.) Under the assumption that this system can be solved, how does one actually 
4 . recover the matrices to return to the original second order differential equation 

: system y"-Ay'- BY =0. 

f 

This paper shall try to answer these questions as completely as possible. 
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Theory 

The solution of the first-order linear differential equation system x' = MX depends 
heavily on the matrix M given. We consider two cases. The most satisfactory solution 
occurs when the matrix M is similar to a diagonal matrix and much of our research has 
centered upon what must be known about the submatrices A and B of M ^ ^jin order 
to assure that M can be diagonalized. We give some elementary linear algebra theory 
to familiarize everyone with the ideas. 

For a linear homogeneous transformation such as M above, there exist scalars i t -and 
vectors X. which satisfy the equation MX • = A-X- • The values of A- for which this 
equation is satisfied are called the eigenvalues of M and the vectors X ± which are fixed 
under the transformation M for each >1^ are called the eigenvectors. There are many 
other names associated with these values and vectors, some of which are 
characteristic values and vectors, latent values and vectors, and proper values and 
vectors. 

Clearly, the zero vector will always satisfy the equation MX = Ax for any A chosen. 
However, we desire to find nontrivial solutions to the problem. Rewriting the equation 
as (M-Al) X = 0 we see that any nontrivial vector X will satisfy the equation if and or ,y 
if det (M-Al) = 0 where det stands for the determinant of the matrix M-Al. 

The determinant of this is a polynomial equation in A* Since, in our case, M is of order 
2n, the polynomial equation f(A) = 0 will be of degree 2n, and hence there will be 2n 
eigenvalues associated with M. These values may or may not be distinct, and we shall 
discuss the consequences later in the paper. 
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For each distinct eigenvalue, X, satisfying f (A) = det (M- XI) = 0 there will exist at least 
one non-trivial eigenvector X satisfying (M-AD X = 0. (Stein, 1967). 
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The eigenvalues and eigenvectors play a central role in the solution of the differential 
equation x' = MX and this is our purpose in discussing them here. 

A matrix M is said to be similar to the matrix C if there exist a nonsingular matrix P 
» such that M = P CP _/ . 

The matrix M is diagonalizable if it is similar to a diagonal matrix 



where the large zeros indicate that all elements off the main diagonal are zeros. 


Suppose now that the matrix M of our system X* = MX is known to be diagonalizable, 
then there exists a nonsingular matrix P such that P MP=D (A,, ...» where the 
notation D (A,, ..., ) will always mean a diagonal matrix where the X,, ..., A ivl 

are the eigenvalues of M and they lie on the main diagonal of D. See Theorems 6.7.1 
and 6.8.1 of (Stein, 1967) for a proof of the above statement. 


If we now set X = PZ where P is the matrix which diagonalizes M, then X* = PZ / and Z 
= P _, X. So PZ' = x'= MX or Z ; - P - ' MX = (P _, MP) Z Thus z' = DZ where D is our 
, diagonal matrix. Using elementary differential equation theory, it is known that a 
solution of Z / = DZ is 


li 




Z = 


c,,e 
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So X = PZ will then solve the original system simply by multiplying the matrix P by the 

S 

vector Z. Thus it is of paramount importance first * determine whether or not the 
original matrix M can be diagonalized and if it can, then how does one obtain the 
matrix P which both diagonalizes M and gives the final solution for the X once we have 
Z as shown above. 


Now consider the second case where the matrix M is not diagonalizable. Every square 

matrix M is similar to a triangular matrix T with the eigenvalues of M as the diagonal 

elements of T. A constructive proof of this fact is given in (Stein, 1967) as Theorem 

6.8.5. The process again generates a nonsingular matrix P such that P *MP = T. Using 

the same construction of X = PZ given above, by elementary differential equation 

theory (Murdoch, 1957), one can solve the system for the original vector X = (X, , X^, 
X T 

...» X.V • If we assume the matrix T is upper triangular then after computing PZ, we 
wHl have X a<n = as in the preceding case, and in general depending on the 

multiplicity of the eigenvalues, if A is an eigenvalue of multiplicity r, then the solution 
for this case will be of the form ) where f>(t ) is a polynominal of degree r-1. 

Thus a solution is possible even in v this case although not as easy to obtain or use. If 

i > J*/ 

the A s are distinct then, for example, if , then the solution for 

X = C ^ , r _ t 

a»,-i -H-/ C T 


where the c«-j 's are the nonzero elements in the upper triangular matrix which 
correspond to each row i and column j position. One can use these solutions then to 
substitute in and solve for the next , and so on, until a complete solution is 
obtained. 
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Results 


We next describe the process of actually obtaining P which will diagonalize or 
triangularize the matrix to give us the solutions indicated above. We consider each of 
the cases separately. 

Suppose first that M is diagonalizable, then there exists a set of 2n linearly 
independent eigenvectors, say U, , U a , ...» U^and this is true regardless of whether or 
not any eigenvalue has multiplicity greater than one or not. See (Edelen, 1976) for a 
discussion of this property. If we suppose that U ( corresponds to A t , U a tol,, ..., U^to 
even if some of the A 's are equal, then the matrix P = (U ( , U a , ..., Uj„) where each 
column of P is a vector of length 2n will be nonsingular since these 2n vectors are 
linearly independent. We observe that if /L=Ay for instance, there will still be distinct 
eigenvectors and Uy which are linearly independent when M is diagonalizable. P 
nonsingular implies that it has an inverse and the product P”* MP will actually be a 
diagonal matrix with the eigenvalues down the main iagonal. 

We will address the process of determining the eigenvectors for a given \ later in the 
paper. 

We now consider the problem of finding the nonsingular matrix P when the matrix M is 
not diagonalizable. 

The process is given as follows. Process : 

(1.) Find an arbitrary eigenvector, X, for the first eigenvalue of your matrix. 
(We shall assume ...,/| Jn are successively on the diagonal from top left to 

lower right.) 
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(2.) Form P, = (X, ,£>, ..., ) provided that P so formed is nonsingular. 

(3.) Compute P’ 1 and calculate the product P,’* MP, where M is our original 
matrix. 

(4.) Using 1 MP, we now have A,in upper left corner. Call the submatrix formed 
by crossing out the row and column which contains 1,, M , . If Mj is a 2x2 matrix 
skip to step 10. Otherwise compute an eigenvector for A x using M, (not M, but it 
will be the same eigenvalue, A. , ai for M). 

(5.) Form P a = (X a ,Pj, ...,^) where X a is the 2n-l eigenvector found in step 4. 

(6.) Compute P a ' and calculate the product P.,* M, P a . This matrix is of order 
2n-l. 

(7.) P a * M, P A will now have in the upper left hand corner and zeros in the 

column belo> it. 

(8.) Form the submatrix M^by crossing out the row and column containing . 

(9.) If is 2x2 goto step 10. If not continue the process as illustrated in steps 
5-8 until a submatrix which is 2x2 is finally obtained. 

(10.) When Mj is reached which is 2x2 the process will be changed as follows: 
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Process (Continued) 


(a) Compute an eigenvector using Mj for . 

(b) Compute an eigenvector using Mj for A, n . 

(c) Let Pj t , = (X^,, X, M ). Note it does not contain standard basis vectors as in 
previous cases. 


(11.) Finally form the matrix P as follows: 


P = P,' 


l 

0 

H 

1 


1 

H 

o 

i 


0 

1 

o 

# 


# # * 

o 

» 


1,= i , s \j> fj 5 efc. 


(12.) Compute P * . Then the final product P 'MP will triangularize M and leave 
the eigenvalues on the diagonal of P -, MP. Note that this form will create an 
upper triangular matrix. 

Note : Each successive eigenvector is found by reducing Mj -Al to canonical form 

for j = o, ...,an-l. Then multiplying the reduced matrix by the appropriately sized 
vector made up of the Xj 's to be in the eigenvector. Solve this system of linear 
equations by choosing appropriate values for the arbitrary X's. You can choose X^ = 
1 for arbitrary X's if so desired. This then will make up the necessary eigenvectci for 
that A . 
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In determining eigenvectors for an M which is diagonalizable, there are two cases to 
consider. The first case is that for which al' eigenvalues are distinct. In this case use 
Gaussian elimination to reduce the matrix M - A I to canonical form (we use this name 
to imply that the matrix has ones on the main diagonal and zeros off the main diagonal 
in so far as possible and always zeros below the main diagonal). Other authors use the 
phrase'row echelon’form to describe this. 

If C is the row echelon form of M - Al» then setting CX = 0, where X is the 2n x 1 
/ .T 

vector X = (X, , X A , ..., XjJ , will allow us to obtain the elements X «> x *» •’ X z„ of 

the eigenvector for A by solving the homogeneous system CX - 0. Any convenient 
choice of the arbitrary X's will give us an eigenvector forA. Since al! A 's are distinct 
in this case, then each of the eigenvectors will be linearly independent so the matrix P 
formed from the eigenvectors will be nonsingular. 

The second case is that where some of the eigenvalues have multiplicity greater than 
one. In this case, if A^is an eip^nvalue of multiplicity t- then there will be *;-linearIy 
independent eigenvectors associated with each such Recall that we are assuming 
the diagonalizability of M at this point, otherwise we would not know that this is 
possible. For each i, one can obtain distinct eigenvectors by choosing different values 
for the arbitrary variables in the equation CX = 0 when solved. Each of these 
eigenvectors will not only be linearly independent of each other but also linearly 
independent of eigenvectors obtained from distinct eigenvalues. See the proof of 
Theorem 6.8.4 of (Stein, 1967) for a proof of this fact. 

Thus all 2n of the eigenvectors so obtained will be linearly independent and thus the 
m^. ‘rix P formed from the eigenvectors as previously indicated will be nonsingular. 
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We have made the assumption in the above work that M is diagonalizable. Consider 
now the question of what conditions on M will guarantee that M is diagonalizable. 


The most well-known theorem says that every real symmetric matrix is diagonalizable 

r _ “1 


(Stein, 1967). See Theorem 6.8.6. However, our matrix M= 
symmetric so we seek other criterion. 


0 x 
B * 


is in general not 


* A second theorem (6.8.2) indicates that M is diagonalizable if and only if M has 2n 
linearly independent eigenvectors. 

The difficulty with using this theorem is that we would like to determine whether or 
not M is diagonal' ’-able before we compute the eigenvectors so we will know wl her 
to use the diagonalization or triangular ization process which we have described above 
in finding the matrix P and its inverse. 


One further theorem (6.8 .4.) tells us M is diagonalizable if and only if for each A’, the 
multiplicity of A; is equal to k-2n-r where r the rank of M-J;I. (Stein, 1967). 

Although this theorem is better than those above and works on all matrices, it still 
requires checking the multiplicity of every against the rank of M-A,’I which could 
be a rather formidable task. 
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Thus we look specifically at the form of our matrix M =[j t0 'feternnine, it possible, 
conditions on B and A which will help us decide the diagonalizability of M. To this 
end, consider the following little known theorem from (Hohn, 1964). 
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If P is m x n , Q is m x n, R is n x m and S is n x n and nonsingular, then 

f> Ql w 

= det S -det (P-QS R). 


det 


* S 


We now assume that P, Q, R, and S above are all square of size n x n and also that SQ 


= QS. We can then prove the following theorem. 
P $1 

det s = det (PS-QR) 

L K \p cf 

Proof: By the first theorem above jdet 


S S 


= det S- det (P-QS R). Then using the 
well-known theorem that det AB = det A • det B when A and B are square n x « 
matrices we apply this to the right member above to have det S' det (P-QS _, R) = det 
(5(P-QS R)) = det (SP-5QS ’r). Since we are assuming SQ = QS we have 
det (SP-SQS^R) = det (SP-QSS' 1 R) = det (SP-QR) since S-ST' =1. 


We now apply this theorem to our matrix M = ° 1 . Since I, the identity matrix, 

6 A 

commutes with every matrix of the same size, we have IA = AI so det M = |OA - B'l| = 
det (-B) = (-ir.det(B). 


Considering the eigenmatrix M- yll we have det (M-Al) = det 
det ((- AlK A- 4l)-B) = det (-K+A -B) = det U*I- A A-B)~ 


-XT X 
6 *-Al\ 


Our purpose in looking at det (M- \ I) is to observe that det (M- A 1) = 0 is the 
characteristic equation of the matrix M, but since det (M- JO-detUVAA-B), then 
det ( A 1 1- Aa-B) = 0 is also the characteristic equation for M, and it is a polynomial 


equation in terms of the matrices A and B which are submatrices of the original 


matrix M. Assume first A-0, then M = 
theorem just proved (S = 0 is not nonsingu 


0 r 
8 o 


and even though we cannot use the 


ar) we can still see that det M = (-lf'det B 
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by Laplace's expansion. To ..:r. npute the determinant of M-J I in this case we have det 

Ri il 

and since -Al is nonsinguiar, "he theorem does apply in this case to 


(M -A I) =det 


6 ~Al 


give us 


det (M- A l)= det (A 1-b) = (-l)*det (B-A I). Since for a fixed n, (-1)*' is a 
constant, it will not affect the eigenvalues in the polynomial characteristic equation. 
Thus if /f; » i = 1, n are the eigenvalues of B, then A; = will be the eigenvalues 
of M. (Note: det (B-/t;I) = 0 implies det (M-A/I) = 0.) 


Assume now that B is diagonalizable. Then for each*;, i =1,2, ..., n there exists an 
eigenvector Ui , i = 1, 2, ..., n such that the set (U, , U* , ..., Un) of eigenvectors is 
linearly independent. Thus let T =(111,112, ..., Un) be the matrix of eigenvectors as 
columns in the matrix. Then T BT = D R , Jt t , ..., will be the diagonal matrix of 
eigenvalues. 


Let S = 


\ o 
o 


be the matrix where the 


Vs here are the positive square roots of then's and thus are half of the eigenvalues of 
M. Consider first the case where AjO for all i = 1, 2, ..., n. Then S" 1 exists and 


is given by S 


* >, 0 
0 




In this case let P = 



fir" 

or P MP s ft 


r r 

fo i ] 

[T5-' 

TS’ 1 ' 



r^r's’-s 

t -; 

’ bo ' 

m — 

T 

-T 

- X 

-Tiers' 1 * 5 

s' 1 - s _ 


Computing T 1 BT S 1 we obtain: 


W. o 1 

X O ' 


O ' 


'>A, O ■ 

1 

y 

o 

o''i 

L J 


1 

i 

o 
1 


0''K 

« * \ 
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pi. 


A,0 

0 "J 


= S. Since T~ BT diagonalizes B to its eigenvalue matrix. 


Thus P~* 


MP = b 


5 + 5 

-s+s 



Hence this choice for P will diagonalize M where the positive square roots of are 
down the diagonal of the upper-left n x n matrix and the negative square roots of H‘ are 
the diagonal elements of the lower right n x n matrix. Thus we have shown how to 
diagonalize M given any diagonalizable B provided the eigenvalues of B are nonzero. 
Thus M is diagonalizable when B is diagonalizable provided none of B's eigenvalues are 
zero, and provided that A = 0. 


Now suppose A = ft I, where ft is any nonzero scalar. We then obtain 

det (M -A I) = det (A 1 I - A {/S I) -B) 

= det(( A - A /5 ) I - B) 

= (-if. det (B-(A-Azi)I). 

Again let h-% i = 1 , 2, ..., n be the eigenvalues of B. We then have A,.- bft ~a^ or 

= 0, for i = 1, 2, n. By solving these quadratic equations we again obtain 
the eigenvalues for M from the eigenvalues for B. 


If we assume a <0 for all i and that all of the eigenvalues for B are distinct, then the 
eigenvalues for M will be distinct, and thus M is diagonalizable. 
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